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Abstract 

Given an undirected graph G, a collection {(si,fi), . ■ ■ , (sk,tk)} of pairs of vertices, and an integer 
p, the Edge Multicut problem ask if there is a set S of at most p edges such that the removal of S 
disconnects every Si from the corresponding f,. VERTEX MULTICUT is the analogous problem where S 
is a set of at most p vertices. Our main result is that both problems can be solved in time 2°' p > ■ n°^\ 
i.e., fixed-parameter tractable parameterized by the size p of the cutset in the solution. By contrast, it is 
unlikely that an algorithm with running time of the form f(p) ■ n ^ exists for the directed version of the 
problem, as we show it to be W[l]-hard parameterized by the size of the cutset. 

1 Introduction 

From the classical results of Ford and Fulkerson on minimum s — t cuts lfl8l to the more recent 0{^\ogn)- 
approximation algorithms for sparsest cut problems ll38l [Tl [161. the study of cut and separation problems 
have a deep and rich theory. One well-studied problem in this area is the EDGE MULTICUT problem: given 
a graph G and pairs of vertices (s\,t\), . . . , (st,tk), remove a minimum set of edges such that every Si is 
disconnected from its corresponding f,- for every 1 < i < k. For k=l, EDGE MULTICUT is the classical s — t 
cut problem and can be solved in polynomial time. For k = 2, EDGE MULTICUT remains polynomial-time 
solvable KOI , but it becomes NP-hard for every fixed k > 3 US. EDGE MULTICUT can be approximated 
within a factor of O(logfc) in polynomial time |fl9l (even in the weighted case where the goal is to minimize 
the total weight of the removed edges). However, under the Unique Games Conjecture of Khot ll26ll . no 
constant factor approximation is possible [7 ]. One can analogously define the Vertex Multicut problem, 
where the task is to remove a minimum set of vertices. An easy reduction shows that the vertex version is 
more general than the edge version. 

Using brute force, one can decide in time n ^ if a solution of size at most p exists. Our main result is 
a more efficient exact algorithm for small values of p (the O* notation hides factors that are polynomial in 
the input size): 

Theorem 1.1. Given an instance of Vertex Multicut or Edge Multicut and an integer p, one can 
find in time 0*(2°( p ') a solution of size p, if such a solution exists. 

That is, we prove that Vertex Multicut and Edge Multicut are fixed-parameter tractable parame- 
terized by the size p of the solution, resolving a very challenging open question in the area of parameterized 
complexity. (Recall that a problem is fixed-parameter tractable (FPT) with a particular parameter p if it 
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can be solved in time f(p) -n ^\ where / is an arbitrary computable function depending only on p; see 
IPT51 [171 l33l for more background). The question was first asked explicitly perhaps in 11281 : it has been 
restated more recently as an open problem in e.g., ll22l HI. Our result shows in particular that multicut is 
polynomial-time solvable if the size of the optimum solution is 0(J/logn) (where n is the input size). 

One reason why multicut is a fundamental problem is that it is able to express several other problems. It 
has been observed that a correlation clustering problem called Fuzzy Cluster Editing can be reduced to 
(and in fact, equivalent with) Edge Multicut EdUHl. Our results show that Fuzzy Cluster Editing 
is FPT parameterized by the editing cost, settling this open problem discussed e.g., in 0. 

Previous work. The fixed-parameter tractability of multicut and related problems has been thoroughly 
investigated in the literature. EDGE MULTICUT is NP-hard on trees, but it is known to be FPT, parame- 
terized by the maximum number p of edges that can be deleted, and admits a polynomial kernel ll5l |23"1. 
Multicut problems were studied in ll22l for certain restricted classes of graphs. For general graphs, Vertex 
Multicut is FPT if both p and and the number of terminal pairs k are chosen as parameters (i.e, the prob- 
lem can be solved in time f(p,k)-n°W (29l [39j HD for some function /). The algorithm of Theorem ll.ll is 
superior to these result in the sense that the running time depends polynomially on the number of terminals, 
and the exponential dependence is restricted to the parameter p. For the special case of MULTIWAY Cut 
(where terminals in a set T have to be pairwise separated form each other), algorithms with running time of 
the form f(p) ■ n°^> were already known I291 I81I2T1. but apparently these algorithms do not generalize in an 
easy way to multicut. An FPT 2-approximation algorithm was given in [30] for EDGE MULTICUT: in time 
0*(2 ( pl ° 8P >), one can find a solution of size 2p if a solution of size p exists. There is no obvious FPT al- 
gorithm for the problem even on bounded-treewidth graphs, although one can obtain linear-time algorithms 
if the treewidth remains bounded after adding an edge j$ for each terminal pair Il20ll34l . A PTAS is known 
for bounded-degree graphs of bounded treewidth (6). 

Our techniques. The first two steps of our algorithm follows QUI . We start by an opening step that 
is fairly standard in the design of FPT algorithms. Instead of solving the original Vertex Multicut 
problem, we solve the compression version of the problem, where the input contains a solution W of size 
p + l, and the task is to find a solution of size p (if exists). A standard argument called iterative compression 
11371 l25l shows that if the compression problem is FPT, then the original problem is FPT. Alternatively, we 
can use the polynomial-time approximation algorithm of Gupta 11241 . which produces a solution W of size 
p if a solution of size p exists. In this case, 0(p 2 ) iterations of the compression algorithm gives a solution 
of size p. 

Next, as in QUI , we try to reduce the compression problem to ALMOST 2S AT (delete k clauses to make 
a 2-CNF formula satisfiable; also known as 2CNF Deletion), which is known to be FPT ll36l [121 l35l . 
However, our 2SAT formulation is very different from the one in 11301 : we introduce a single variable x v 
only for each vertex of G, while in ll30l there is a variable x VM , for every vertex v G V (G) and vertex w £W 
of the initial solution. This simpler reduction to ALMOST 2SAT is correct only if the instance satisfies two 
quite special properties: 

(1) every component of G \ W is adjacent to at most two vertices of W ("has at most two legs"), and 

(2) there is a solution S such that every component of G \ S contains a vertex of W ("no vertex is isolated 
from W after removing the solution S" or "no vertex is in the shadow of 5"'). 

The main part of the paper is devoted to showing how these properties can be achieved. In order to achieve 
property (1), we show by an analysis of cuts and performing appropriate branchings that the set W can be 
extended in such a way that every component has at most two legs (Section HI). To achieve property (2), 
we describe a nontrivial way of sampling random subset of vertices such that if we remove this subset by a 
certain contraction operation (taking the torso of the graph), then without changing the solution, we get rid 
of the parts not reachable from W with some positive probability (Section[3]). This random sampling uses the 
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concept of "important separators," which was introduced in |29l , and has been implicitly used in Il9ll36ll8l 
in the design of parameterized algorithms. We consider the random sampling of important separators the 
main new technical idea of the paper. This technique has turned out to be useful for other problems as well 
iflOl l27l and we expect it to have further application in the future. 

Directed graphs. Having resolved the fixed-parameter tractability of Vertex Multicut, the next 
obvious question is what happens on directed graphs. Note that for directed graphs, the edge and vertex 
versions are equivalent. In directed graphs, multicut becomes much harder to approximate: there is no 
polynomial-time 2 log "-approximation for any e > 0, unless NP C ZPP ifTTl . From the fixed-parameter 
tractability point of view, the directed version of the problem received particular attention because DI- 
RECTED Feedback Vertex Set or DFVS (delete p vertices to make the graph acyclic) can be reduced to 
Directed Multicut. The fixed-parameter tractability of DFVS had been a longstanding open question 
in the area of parameterized complexity until it was solved by Chen et al. @ recently. The main idea that 
led to the solution is that DFVS can be reduced to a variant (in fact, special case) of DIRECTED MUL- 
TICUT called Skew Multicut, where the task is to break every path from s t to tj for every i > j. By 
showing that Skew Multicut if FPT parameterized by the size of the solution, Chen et al. @ proved 
the fixed-parameter tractability of DFVS. We show in Section [6] that, unlike Skew Multicut, the general 
Directed Multicut problem is unlikely to be FPT. 

Theorem 1.2. Directed Multicut is Vf[l]-hard parameterized by the size p of the solution. 

Independent and follow up work. A preliminary version of this paper appeared in OTl ; the current 
version contains essentially the same algorithm, but the terminology and organization of Section [5] were 
significantly changed. Independently from our work, Bousquet et al. [4J presented in the same volume a 
proof that MULTICUT is FPT parameterized by the size p of the solution. The two algorithms have certain 
parts in common: both reduce the problem to the compression version and both ensure that we have to 
deal with components having only two legs. However, the main part of the two algorithms are substantially 
different: the current paper introduces the technique of random sampling of important separators and uses 
it to reduce the problem to ALMOST 2SAT, while Bousquet et al. H uses an approach based on a series of 
problem-specific reductions to reduce the problem to 2SAT. 

Subsequently to the first version of this paper, random sampling of important separators has been used 
in two other applications. Even though DIRECTED MULTICUT is W[l]-hard parameterized by p (see Sec- 
tion ©, Chitnis et al. OH proved that the special case DIRECTED MULTIWAY Cut (Given a set T of 
terminals, break every directed path between two different terminals by removing at most p edges/vertices) 
is FPT parameterized by p. A consequence of this result is that DIRECTED MULTICUT with k = 2 is FPT pa- 
rameterized by p is FPT. The complexity of DIRECTED MULTICUT for k = 3 or with combined parameters 
k and p remains an interesting open question. 

A very different application of the technique is given by Lokshtanov and Marx fTj\ in the context of 
clustering problems. They study a family of clustering problems such as partitioning the vertices of an 
undirected graph into clusters of size at most p such that at most q edges leave each cluster. The problem 
boils down to being able to check whether a given vertex v is contained in such a cluster. It turns out that 
the random sampling of important separators technique can be used to show that this task (and therefore the 
original clustering problem) is FPT parameterized by q by reducing it to a knapsack-like problem. 

2 Framework: compression, shadows, legs 

Let G be an undirected graph and let T = {(ji,^), . .. , (sk,tk)} be a set of terminal pairs. We say that a set 
S C V(G) of vertices is a multicut of (G,T) if there is no component of G\S that contains both Sj and tj 
for some 1 < i < k (note that it is allowed that S contains Sj or f,-). The central problem of the paper is the 
following: 
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Vertex Multicut 


Input: 


A graph G, an integer p, and 




a set T of pairs of vertices of G 


Output: 


A multicut of (G,T) of size at most p 




or "NO" if no such multicut exists. 



We prove the fixed-parameter tractability of Vertex Multicut by a series of reductions (see Figure [T|). 
First we argue that it is sufficient to solve an easier solution compression problem. Then following two 
reductions modify the problem in such a way that it is sufficient to look for solutions that are shadowless 
and we can assume that the instance is bipedal. The last step of the proof is reducing this special variant of 
the problem to ALMOST 2SAT. 

2.1 Compression 

The first step in the proof of Theorem ll.ll is a standard technique in the design of parameterized algorithms: 
we define and solve the compression problem, where it is assumed that the input contains a feasible solution 
of size larger than p. As this technique is standard (and in particular, we follow the approach of QUI for 
Edge Multicut), we keep this section short and informal. 



Multicut Compression 


Input: 


A graph G, an integer p, 




a set T of pairs of vertices of G, and 




a multicut W of (G,T) 


Output: 


A multicut of (G,T) of size at most p, 




or "NO" if no such set multicut exists. 



Our main technical contribution is showing that MULTICUT COMPRESSION is FPT parameterized by p and 
\W\. 

Lemma 2.1. Multicut Compression can be solved in time 0*(2 o ((p+ lo gW) 3 +W lo gW)). 

Intuitively, it is clear that proving Lemma [2TT1 could be easier than proving that Vertex Multicut 
is FPT: the extra input W can give us useful structural information about the graph (and as \W\ appears in 
a running time, a large W is also helpful). What's not obvious is how solving Multicut COMPRESSION 
gives us any help in the solution of the original Vertex Multicut problem. We sketch two methods. 

Method 1. Let us use the polynomial-time approximation algorithm of Gupta [24] to find a multicut 
W of size at most c ■ OPT 2 , where c is a universal constant and OPT is the minimum size of a multicut. If 
| W | >c-p 2 , then we can safely answer "NO", as there is no multicut of size at most p. Otherwise, we run 
the algorithm of LemmaOfor this set W to obtain a solution in time O* (2°^P +lo ^ w ^ = 0*{2 ^). 

Method 2. The standard technique of iterative compression i37l |25| allows us to reduce Vertex 
Multicut to at most \V(G)\ instances of Multicut Compression with \W\ =p+l. This technique was 
used for the 2-approximation of EDGE MULTICUT in lf30ll and its application is analogous in our case. Let 
(G,T,p) be an instance of Vertex Multicut. Suppose that V(G) = {v\,.. . ,v„}, let G; = G[{vi,...,v;}], 
and let T, be the subset of T containing the demands with both endpoints in G,. One by one, we consider 
the instances (G;,Ti,/?) in ascending order of i, and for each instance we find a solution 5; of size at most 
p. We start with So = 0. For some i > 0, we compute S, provided that 5,-_i is already known. Observe 
that S,_i U{v,} is a multicut of size p + l for (G;,T,-). Thus we can use the algorithm for MULTICUT 
Compression, which either returns a multicut S; of (G,-,T,) having size at most p or returns "NO". In the 
first case, we can continue the iteration with i+ 1. In the second case, we know that there is no multicut of 
size p for (G,T) (as there is no such multicut even for (G,-,T,)), and hence we can return "NO". 
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Iterative compression or approximation 
(Section |2) 



Multicut Compression* 



Random sampling of important separators 
(Section O 



Multicut Compression* 
(shadowless solution) 



Branching on shattering sets 
(Section |Hi 



Bipedal Multicut Compression* 
(shadowless solution) 



Encoding into 2SAT 
(Section 01 



Almost 2SAT 



Previous work 



t 

FPT 



Figure 1 : The chain of reductions in the paper. 
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Both methods result in 0*{2 0<ypi ^) time algorithms. However, we feel it important to mention both 
approaches, as improvements in Lemma [2j] might have different effects on the two methods. 

It will be convenient to work with a slightly modified version of the compression problem. We say that 
a set S C V(G) is a multiway cut of W C V(G) if every component of G\ S contains at most one vertex of 
W. 



Multicut Compression* 


Input: 


A graph G, an integer p, 




a set T of pairs of vertices of G, and 




a multicut W of (G,T) 


Output: 


A set S of size at most p such that 




(1) 5 is multicut of (G,T), 




(2) SDW = <d, and 




(3) 5 is a multiway cut of W 




or "NO" if no such set S exists. 



That is, Multicut Compression* has two additional constraints on the solution S. In Sections [SHU we 
prove that this problem is FPT: 

Lemma 2.2. Multicut Compression* can be solved in time O* {2°^ p+Xo ^ w ^). 

It is not difficult to reduce MULTICUT COMPRESSION to MULTICUT COMPRESSION* (an analogous 
reduction was done in [30] for the the edge case). We briefly sketch such a reduction. In order to solve an in- 
stance (G,T,W,p) of Multicut Compression, we first guess the intersection X of the multicut W given 
in the input and the solution 5 we are looking for. This guess results in at most Y%=\ \i ) branches; in each 
branch, we remove the vertices of X from G and decrease p by \X\. Thus in the following, we can restrict our 
attention to solutions disjoint from W. Next, we branch on all possible partitions (W\ ,W t ) ofW, contract 
each W,- into a single vertex, and solve MULTICUT COMPRESSION* on the resulting instance (G',T', W',p'). 
One of the partitions {W\ ,W t ) corresponds to the way the solution S partitions W into connected com- 
ponents, and in this case S is a multiway cut of W' in G'. Thus if the original MULTICUT COMPRESSION 
instance has a solution S, then it is a solution of one of the constructed MULTICUT COMPRESSION* in- 
stances. Conversely, any solution of the constructed instances is a solution of the original instance. As the 
number of partitions of W can be bounded by [VKl ^^, the running time claimed in Lemma |2~T1 follows 
from Lemma [2T2l Thus in the rest of the paper, it is sufficient to prove Lemma I2T21 to obtain the main result 
Theorem ll.il 

2.2 Shadows 

Let / = (G,T,W,p) be an instance of the MULTICUT COMPRESSION* problem, and let S be a solution for 
/. The shadow of the set S is the set of vertices not reachable from any vertex of W in G \ S (intuitively, we 
imagine the vertices in W as light sources, light spreads on the edges, and S blocks the light). We say that 
the solution 5 is shadowless if the shadow is empty, i.e., G\S has exactly \W\ components. In Section |3l 
we present a randomized algorithm that modifies the instance such that if a solution exists, then it makes 
the solution shadowless with positive probability. The algorithm is based on a randomized contraction 
of sets defined by "important separators"; we review this concept in Section 13.31 The algorithm can be 
derandomized to obtain the following lemma: 
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Lemma 2.3 (shadowless reduction). Given an instance I of the MULTICUT COMPRESSION* problem, we 
can construct in time O* (2°( p >) a set of t = 2°( p ' logn instances I\, . .., I t , each with the same parameter 
p as I, such that 

1. Any solution of li for any 1 <i <t is a solution of I. 

2. If I has a solution, then 7, has a shadowless solution for at least one 1 < i < t. 

Thus Lemma l231 allows us to reduce the MULTICUT COMPRESSION* problem into a variant where the 
task is to rind a shadowless solution. 

2.3 Components and legs 

In order to find a shadowless solution for a MULTICUT COMPRESSION* instance, the problem is fur- 
ther transformed in Section 0] using the concept of legs. Given an instance (G,T,W,p) of MULTICUT 
Compression* , we say that a component C of G \ W has i-legs if C is adjacent with i vertices of W. We 
say that an MULTICUT COMPRESSION* instance is bipedal if every component of G\W has at most two 
legs; Bipedal Multicut Compression* is the problem restricted to such instances. The transformation 
presented in Section 0] reduces MULTICUT COMPRESSION* to a bounded number of bipedal instances. 

Lemma 2.4 (bipedal reduction). Given an instance I of the MULTICUT COMPRESSION* problem with pa- 
rameter p, we can construct in time 0*(2°( p ^) a set oft = 2°^ p+lo ^ w ^ BIPEDAL MULTICUT COMPRESSION* 
instances I\, I t , each with parameter at most p 

1. Any solution of li for any 1 <i <t is a solution of I. 

2. If I has a shadowless solution, then Ij has a shadowless solution for at least one 1 < i < t. 

Finally, in Section \5\ we show how this solution be found by a quite intuitive reduction to an FPT 
problem ALMOST 2SAT. 

Lemma 2.5. Let I = (G,T,W,p) be an instance of Bipedal Multicut Compression* that has a shad- 
owless solution S of size at most p. In time 0*(4 P ), we can find a (not necessarily shadowless) solution 
S'. 

Combining Lemmas I23T42. 51 allows us to prove Lemma l2T2l and therefore to solve Vertex Multicut. 

Proof (of Lemma l22l ). Let us apply the Algorithm of Lemma [231 to an instance I = (G,T, W,p) of MUL- 
TICUT Compression*. This algorithm takes time 0*(2 0( - p3 ^) and produces t = 2 0( - p3 Hogn instances 7, of 
the Multicut Compression* problem, each with parameter at most p, so that the original instance I has 
a solution if and only if one of these t instances has a shadowless solution. Moreover a (not necessarily 
shadowless) solution of any of these instances is also a solution of the orginal instance. 

Apply to each instance /; the algorithm of Lemma l2.4[ which in time O* (2°^ p+log ^ w ^ ') either returns an 
answer or produces 2°^ p+log ^ w ^ > instances 7/j, each with parameter at most p, of the Bipedal Multicut 
COMPRESSION* problem such that 7, has a shadowless solution if and only if at least one 7,-j has a shadowless 
solution. Moreover a (not necessarily shadowless) solution of any new instance I(j is also a solution of 7,. 

Combining the above two steps, we conclude that in time O* (2°^ (f + lo g I w I ) 3 ) ) the algorithm produces 
20((p+iog|w|) 3 )j g W j nstances f t h e Bipedal Multicut Compression* problem such that the original 
instance I has a solution if and only if at least one of the these 2°^ p+log ^ w ^ ) logrc instances has a shadowless 
solution. Moreover a (not necessarily shadowless) solution of any instance Ijj is also a solution of I. 

Apply to each resulting instance j of the Bipedal Multicut Compression* problem the algorithm 
of Lemma l231 If the algorithm returns a solution for at least one of the instances then, as said above this is a 
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solution of the original instance /. If the algorithm returns 'NO' for all the instances, this means that no one 
of them has a shadowless solution. It follows that the original instance does not have a solution either. Taking 
into account that the algorithm of Lemma 12.51 takes time 0*(4 P ), processing of2°((P+ lo gl w l) 3 )log n instances 
takes time O* (2 t^ +log l w D 3 ) ). Consequently, the instance / of the MULTICUT COMPRESSION* problem can 
be solved in time 0*(2°^ p+lo ^ w ^). □ 

3 Making the solution shadowless 

The purpose of this section is to reduce solving MULTICUT COMPRESSION* to finding a shadowless so- 
lution. We present a randomized transformation that, given an instance having a solution, it modifies the 
instance in such a way that the new instance has a shadowless solution with probability 2~°( p '. More 
precisely: 

Lemma 3.1. Given an instance I of the MULTICUT COMPRESSION* problem, we can construct in time 
O* (2°( p } ) an instance I' with the same parameter p as I such that 

1. Any solution of I 1 is a solution of I. 

2. If I has a solution, then I' has a shadowless solution with probability 2~°( pi \ 

This means that if / has a solution, then by invoking Lemma I3TT1 2°( p3 ) times, with constant probability 
at least one of the instances has a shadowless solution. Thus if we are able to solve the problem with 
the assumption that a shadowless solution exists, then this way we can get a solution for / with constant 
probability. The main result of this section is a derandomized version of this transformation (Lemma [23]). 

The main idea in the proof of Lemma 13.11 is to try to randomly guess a set Z whose removal does not 
change the instance substantially, but makes the instance shadowless. Section l3Tl introduces the torso opera- 
tion, which is used to remove the set Z, and states what properties the set Z need to satisfy. The construction 
of Z is based on the observation that the solution can be characterized by a "closest set" and we need to 
locate the boundary of such a set (Section 13.21) . We develop a randomized algorithm for this purpose in 
Sections [3.3143.61 The algorithm uses the notion of important separators; Section [331 reviews this concept 
and shows why it is relevant for our problem. Sections I3T4143. 51 describe and analyze the randomized selec- 
tion process. Section [3T6l shows how the random selection can be derandomized to obtain the deterministic 
version (Lemma [231 ) . 

3.1 Torsos and shadowless solutions 

The randomized transformation can be conveniently described using the operation of taking the torso of a 
graph. 

Definition 3.2. Let G be a graph and C C V(G). The graph torso (G,C) has vertex set C and two vertices 
a,b G C are adjacent if{a,b} G E{G) or there is a path P in G connecting a and b whose internal vertices 
are not in C. 

In particular, every edge of G[C] is in torso (G,C). It is easy to show that this operation preserves 
separation inside C: 

Proposition 3.3. Let C C V{G) be a set of vertices in G and let a,b EC two vertices. A set S C C separates 
vertices a and b in torso(G,C) if and only ifS separates these vertices in G. 
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Proof. Let P be a path connecting a and b in G and suppose that P is disjoint from the set S. The path P 
contains vertices from C and from V(G) \ C. If «,v G C are two vertices such that every vertex of P between 
u and v is from V(G) \ C, then by definition there is an edge uv in torso (G,C). Using these edges, we can 
modify P to obtain a path P' that connects a and b in torso (G,C) and avoids 5. 

Conversely, suppose that P is a path connecting a and Z? in the graph torso (G,C) and it avoids S C C. 
If P uses an edge mv that is not present in G, then this means that there is a path connecting u and v whose 
internal vertices are not in C. Using these paths, we can modify P to obtain a path P' that uses only the edges 
of G. Since S C C, the new vertices on the path are not in S, i.e., P' avoids S as well. □ 

Let / = (G, W,T,p) be an arbitrary instance of MULTICUT COMPRESSION* . Given a set Z C V(G)\W 
of vertices, the reduced instance I/Z = (G' ,W,T' ,p) is defined the following way: 

1 . The graph G' is torso (G,V(G) \Z). 

2. For every v € V(G), let (v) = N (C) if v belongs to component C of G[Z], and let (v) = {v} if v ^ Z. 
The set T' is obtained by by replacing every pair (x, y) G T with the set of pairs {(x 1 ,y') \ x' G € 

*(y)}. 

The main observation is that if we perform this torso operation for a Z that is sufficiently large to cover 
the shadow of a hypothetical solution S and sufficiently small to be disjoint from 5, then S becomes a 
shadowless solution of I/Z. Furthermore, the torso operation is "safe" in the sense that it does not make the 
problem easier, i.e, does not create new solutions. 

Lemma 3.4. Let I = (G,T,W,p) be an instance of MULTICUT COMPRESSION* and let Z C V(G)\W be 
a set of vertices. 

(1) Every solution of I/Z is a solution of I. 

(2) If I has a solution S such that Z covers the shadow and ZDS = 0, then S is a shadowless solution of 
I/Z. 

Proof. Let G and G' = torso(G,\ / (G) \Z) be the graphs in instances / and I/Z, respectively. To prove the 
first statement, we show that if S' C V(G') is a solution of I/Z, then S' is a solution of / as well. Suppose 
that some pair (x,y) of / is not separated by S'. Let P be a path in G \ S' going from x to y. Let xf and / be 
the first and last vertex of P not in Z, respectively, and let P' be the subpath of P from xf to y' . (Note that P 
cannot be fully contained in Z, as it contains at least one vertex of W.) By the way I/Z is defined, (x' ,y') is a 
pair in I/Z, hence 5" separates x' and y' in G' = torso(G,C). Using Prop. 13.31 with C = V(G)\Z, we get that 
S' separates x' and y' in G, which is in contradiction with the existence of the path P. A similar argument 
shows that there is no path in G \ S' that connects two vertices of W. 

For the second statement, suppose that S is a solution of / with SnZ = 0. Let us show that S is a solution 
of I/Z as well. Suppose that 5 does not separate x' and / in G' for some pair (x' ,y') of I/Z. Using Prop. 13.31 
with C = V(G)\Z, we get that S' does not separate x' and y' in G, i.e., there is ax 1 — y' path P in G\S. By 
the way the pairs in I/Z were defined, there is a pair (x,y) of / and there is an x — xf path Pi such that x' 
is the only vertex of Pi not in Z, and there is ay — y' path P2 such that y' is the only vertex of P2 not in Z. 
Clearly, these paths are disjoint form S. Therefore, the concatenation of Pi, P, P% is an x — y path in G\S, 
contradicting that S is a solution of /. 

To see that S is shadowless in G' , consider a vertex v of G' \ S. As v Z is not in the shadow of the 
solution S of /, there is a path P in G \ S going from v to a vertex w£ff. Again by Prop. 13.31 this means that 
there is a v — w path in G' \ S as well, which means that v is not in the shadow of the solution S of /'. □ 
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3.2 Closest sets 



Lemma |3~l4l shows that in order to reduce the MULTICUT COMPRESSION* problem to rinding a shadowless 
solution, all we need is a set Z that covers the shadow of a hypothetical solution 5, but disjoint from S itself. 
It is not obvious how this observation is of any help: it seems that there is no way of constructing such a set 
without actually knowing solution S. Nevertheless, we present a randomized procedure that construct such 
a set with nonnegligible probability. 

The main idea of the randomized procedure is that a solution of a MULTICUT COMPRESSION* instance 
can be characterized by the set of vertices reachable from W, and we can assume that this set has the 
property that it cannot be made smaller without increasing the size of the boundary. The following definition 
formalizes this property: 

Definition 3.5. Let G be an undirected graph and let W C V(G) be a subset of vertices. We say that a set 
R^W is a W-closest set if there is no R' C R with R' 3ff and \N(R')\ < \N(R)\. 

The main technical idea of the paper is the following randomized procedure, which, in some sense, finds 
the boundary of a closest set. Note that this statement could be of independent interest, as it is about closest 
sets in general and contains nothing specific to multicut problems. 

Theorem 3.6 (random sampling). There is a randomized algorithm RandomSet(G, W,p) that, given a graph 
G, a setW C V(G), and an integer p, produces a set Z C V(G) \ W such that the following holds. For every 
closest set R with \N(R)\ < p, the probability that the following two events occur is at least 2~°( p3 \- 

1. N(R)DZ = <d, and 

2. V(G)\(RUN(R))QZ. 

That is, the two events say that Z covers every vertex outside RUN(R) and may cover some vertices 
inside R, but disjoint from N(R). To prove Theorem 13 .61 we introduce the main new technique of the paper: 
random sampling of important separators. In Section 13.31 we review the notion of important separators. 
Section [3~4l contains a simplified proof of Theorem 13.61 (with probability bound 2 2 instead of 2~°( p3 )). 
The full proof appears in Section [331 We show below that Theorem I3.6l can be used to prove Lemma |3~T1 
Section l33l shows how to derandomize Theorem l3.6l which immediately proves Lemma [231 

Proof ( of Lemma EOT). Let / = (G,W,T,p) be an instance of MULTICUT COMPRESSION*. Let us use the 
algorithm RandomSet(G, W,p) of Theorem 13.61 to obtain a set Z and let /' = I/Z. By Lemma l33fl every 
solution of /' is a solution of / as well. 

Assume now that / has a solution S; let S be a solution such that \S\ is minimum possible, and among such 
solutions the set R of vertices reachable from W in G\S is as small as possible. Clearly, N(R) C S. We claim 
that R is a W-closest set. Suppose that there is a set R' C R containing W such that |A^(i?')l ^ 1^(^)1- Let 
5' =N(R'), we have that \S'\ < \S\. We claim that S' is a solution, contradicting the minimality of 5. Suppose 
that there is a path P in G\S' connecting the two terminals in a demand (x,y) G T or two vertices of W. In 
both cases, P has to go through a vertex of W (here we use that the definition of MULTICUT COMPRESSION* 
requires that W is a multicut). Therefore, P is fully contained in R' C R, which implies that it is disjoint from 
S, i.e., S is not a solution. Thus we have proved that R is indeed a W-closest set. The same argument shows 
that N(R) is a solution, hence N(R) = S has to hold. 

As R is a W-closest set, the probability that both SDZ = and V(G) \(RUS)QZ hold is 2~°(A The 
later inclusion is equivalent to saying that the shadow of the solution S is contained in Z. Therefore, by 
Lemma l3~4l set S is a shadowless solution of instance /'. □ 
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3.3 Important separators 

The concept of important separators was introduced in ||29l to deal with the multiway cut problem. Let G 
be an undirected graph and let X,Y C V(G) be two disjoint sets. A set S C V(G) of vertices is an X — Y 
separator if S is disjoint fromXUF and there is no component 1 K of G\S with both KCiX ^ andKHY ^ 0. 
To improve readability, we write s — Y separator instead of {s} — Y separator if s is a single vertex. 

Definition 3.7. Let X,Y C V(G) be disjoint sets of vertices, S C V(G) be an X — Y separator, and let K be 
the union of every component of G\S intersecting X. We say that S is an important X — Y separator if it is 
inclusionwise minimal and there is noX — Y separator S' with \S'\ < |5| such that K' D K, where K' is the 
union of every component ofG\S' intersecting X. 

Note that the order of X and Y matters: an important X — Y separator is not necessarily an important 
Y — X separator. It is easy to see that if S is an important X — Y separator, then 5 = N(R) for some set R 
with X C R and (R U N(R) ) PI Y = 0. Furthermore, it can be assumed that every component of G [R] contains 
at least one vertex of X: otherwise, if removing a component not containing a vertex of X from R decreases 
N(R), then N(R) was not a minimal X — Y separator. In particular, if X contains only a single vertex, then 
we can assume that G[R] is connected. 

A bound on the number of important separators was given in |[29l (although the notation there is slightly 
different). A better bound is implicit in [8]. For the convenience of the reader, we give a self-contained 
proof of the following fact in the appendix. 

Lemma 3.8. Let X,Y C V(G) be disjoint sets of vertices in graph G. For every p > 0, there are at most 4 P 
important X — Y separators of size at most p. Furthermore, we can enumerate all these separators in time 
0*(4P). 

The following lemma connects closest sets and important separators by showing that the boundary of a 
closest set is formed by important separators. Intuitively, every vertex v outside the closest set R "sees" a 
part of the boundary N(R) which is an important v — W separator: otherwise, we could "push" this part of 
the boundary away from v and towards W, contradicting the assumption that R is a closest set. 

Lemma 3.9 (pushing). Let G be an undirected graph, W a set of vertices, andR a W -closest set. For every 
vertex v RL)N(R), there is an important v — W separator S v C N(R). 

Proof. Let v be an arbitrary vertex of G not in RUN(R) and let K be the component of G\N(R) containing 
v. As v $RUN(R) and W C R, we have that K is disjoint from W. We show that N(K) is an important v — W 
separator. First, we observe that N(K) is a minimal v — W separator: we have N(K) C N(R), thus every 
vertex of N(K) is adjacent to both K and R. Suppose now that there is a K 1 D K such that \N(K')\ < \N(K)\. 
We may assume that G[K'\ is connected. Let/?' :=R\(K'UN(K')). NowN(R') C (N(R)\N(K))UN(K'): it 
is clear that every neighbor of R' is in N(R) UN(K') (as it cannot be in K') and every vertex of N(K) \N(K') is 
fully contained in^'. Thus \N(R')\ < \N(R) \ follows from \N(K')\ < \N(K)\. Furthermore, the connectivity 
of G[K'} and K C K' implies that K' contains a vertex of N(K) C N(R) and therefore K'\JN(K') contains a 
vertex of R. This means that R is a proper subset of R' with \N(R')\ < \N(R)\, contradicting the assumption 
that R is a IV-closest set. □ 

3.4 Random sampling of important separators — simplified proof 

In this section, we present a simpler version of the proof of Theorem 13 .61 where the probability of success is 
double exponentially small in p. This simpler proof highlights the main idea of the randomized reduction. 
The full proof, which improves the probability to 2~°( p ) with additional ideas, appears in Section [331 

'Throughout this paper, when we refer to a component A* of a graph, we consider the set of vertices of this component. We omit 
saying "the set of vertices of" for the sake of brevity. 
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By Lemma [3^81 we can enumerate every separator of size at most p that is an important v — W separator 
for some v. We will call such sets "impseps" for short. 

Definition 3.10. A set S C V(G) \ W is an impset if S an important v — W separator of size at most p for 
some vertex v G V(G) \ (W U S). 

By Lemma I3T81 there are at most 4^ • |V(G)| impseps and we can enumerate them in time 0*(4 P ). 

Recall that the shadow of a set S is the set of vertices not reachable from W in G \ S. By Lemma I3T91 
every vertex of the shadow of N(R) is covered by the shadows of impseps contained in N(R), i.e., by at 
most 2 P impseps. This suggests that we may construct a set Z satisfying the conditions of Theorem 13 .61 by 
guessing these impseps and obtaining Z as the union of the shadows of the selected impseps. However, in 
general the number of impseps cannot be bounded as a function of p only: as some impseps may not be 
contained in N(R), their number can depend on |V(G)| as well. Thus complete enumeration of all possible 
subset of 2 P impseps is not feasible. Instead, we randomly select a subset of impseps and hope that it 
contains these at most 2 P impseps and it does not contain any impsep whose shadow intersects N(R). 

The probability of randomly selecting an impsep should not be too high, because we want to avoid 
selecting any impsep whose shadow contains a vertex of N(R). We need a bound on the number such 
impseps. Intuitively, the bound of Lemma l3T8l on the number of important separators should imply that each 
vertex of N(R) is contained in the shadow of a bounded number of impseps, but in order to make this claim 
precise, we need to consider a slightly different notion of shadow: 

Definition 3.11. The exact shadow of a set S C V(G) \ W contains those vertices v G V(G) \(W US) for 
which S is a minimal v — W separator. 

The following lemma is true only for exact shadows: the bound in (2) is not true with the original 
definition of shadow. 

Lemma 3.12. (I) IfS is an impsep, then v G V(G) \ (W U S) is in the exact shadow ofS if and only ifS is 
an important v — W separator. 
(2) Each vertex v G V(G) \ W is contained in the exact shadow of at most \ p impseps. 

Proof. (1) By definition, if S is an important v — W separator, then S is a minimal v — W separator, hence v is 
in the exact shadow of S. For the other direction, suppose that v is in the exact shadow of and impsep S. As 
S is an impsep, there is a vertex u G V(G) \(WUS) such that S is an important u — W separator. If S is not 
an important v — W separator, then (since the definition of exact shadow implies that S is a minimal v — W 
separator) there is a v — W separator 5' with \S'\ < \S\ and such that a larger set of vertices are reachable 
from v in G \ S' than in G \ S. We claim that S' is a u — W separator as well. Suppose that there is a u — W 
path P in G \ S'. This path has to go through S \ S'. Since S is a minimal v — W separator, S has a neighbor 
reachable from v in G \ S and hence in G \ S'. Therefore, u and v are in the same component of G \ S', 
contradicting the assumption that 5' is a v — W separator. Next we show that every vertex r reachable from 
u in G \ S is reachable from u in G\ S'. Let P be an u — r path in G \ S and suppose that it contains a vertex 
q G 5" \ S. As S' is a minimal v — W separator, there is a q — W path Q in G \ S'. The concatenation of the 
prefix of P ending at q and Q is a u — W walk, hence Q has to contain a vertex q' G 5. By the definition 
of <2, this vertex q' has to be in S \ S' and hence it is reachable from v in G\S'. However, the subpath of 
Q from q' to W does not contain any vertex of 5'. This shows that every vertex reachable from u in G\S 
remains reachable in G \ S', contradicting the assumption that S is an important u — W separator. Therefore, 
S is indeed a important v — W separator. 

(2) By Lemma [3T8l there are at most 4 P important v — W separators of size at most p, thus by (1), vertex 
v can be contained in the exact shadows of at most that many impseps. □ 

Combining Lemmas I3T91 and l3T2l we immediately have: 
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Proposition 3.13. LetR be a W '-closest set and let S = N(R). Then every vertex v ^ RUN(R) is in the exact 
shadow of an impsep S v C S. 

We use Prop. |3.13| to bound the probability that the constructed set Z satisfies the second condition of 
Theorem 13.61 We need the following simple observation to argue that the selection of these sets do not 
interfere with the first condition of Theorem [331 

Lemma 3.14. Let R be a W -closest set and let S = N(R) and S' C S. Then the shadow ofS' is disjoint from 



Proof. Suppose that v G S is in the shadow of S' C S, i.e., v S' and S' is a v — W separator. As v G 
N(R), vertex v has a neighbor r G R. We can assume that every component of G[R] contains a vertex 
of W: otherwise removing a component disjoint from W strictly decreases R without increasing |iV(7?)|, 
contradicting the assumption that R is a IV-closest set. This means that there is a path from r to W fully 
contained in R. It follows that there is a path from v to W fully contained in RU {v}, which is disjoint from 
S', contradicting the assumption that v is in the shadow of S'. □ 

In the simplified proof of Theorem I3.6L we select impseps uniformly at random and take the union of 
their exact shadows. In light of Lemmas 13.91 and |3. 12[ there is a set of at most 2 P impseps that have to be 
selected and there is a set of at most N(R) • 4 P impseps that have to avoided in order for the random selection 
to be successful. 

Simplified proof of Theorem U^6\ The algorithm RandomSet(G, W,p) first enumerates every impsep of size 
at most p; let X be the collection of all these sets. By Lemma [3781 the size of X is 0*{4 P ) and can be 
constructed in time 0*(4 P ). Let X' be the subset of X where each element from X occurs with probability 
4 independently at random. Let Z be the union of the exact shadows of every set in X' . We claim that the 
set Z satisfies the requirement of the theorem. 

Let R be a VK-closest set and let S = N(R). LetX\,X2, ■ . ■ ,X<i G X be the impseps that are fully contained 
in S. As \S\ < p, we have d < 2 P . By Lemma [3. 141 we have that the exact shadow of Xj is disjoint with S 
for every j G [d]. Now consider the following events: 

(El) zns = 

(E2) the exact shadow of Xj is a subset of Z for every j G [d] . 

Note that by Prop. 13.131 event (E2) implies that the shadow of S is fully contained in Z, i.e., V(G) \(RU 
N(R)) C Z. Our goal is to show that with probability 2 2 ° {p) , events (El) and (E2) both occur. 

Let A = {X\,X2, ■ ■ ■ ,Xd} and let B contain those sets in X whose exact shadows intersect S. By 
Lemma |3.12[ each vertex of S is contained in at most 4 P exact shadows of impseps. Thus |S| < \S\ • 4 P < 
p-4 p . If no impsep from from B is selected, then event (El) occurs. If every impsep from A is selected, 
then event (E2) occurs. Thus the probability that both (El) and (E2) occur is bounded from below by the 
probability of the event that every element from A is selected and no element from B is selected. Note that A 
and B are disjoint: A contains only sets whose exact shadows are disjoint from S, while B contains only sets 
whose exact shadows intersect S. Therefore, the two events are independent and the probability that both 
events occur is at least 



S. 




□ 
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3.5 Random sampling of important separators — full proof 

In order to optimize the success probability, we perform the randomized selection of important components 
in two phases: first we select some impseps and add new edges to the graph and in the second phase 
we restrict our attention to impseps that induce cliques in the modified graph. We observe that important 
separators that induce cliques are nested, hence we can get a bound of p instead of 4 P for the number of 
such separators. 

Lemma 3.15. For every vertex v G V(G)\W, there are at most p important v — W separators of size at most 
p inducing a clique. 

Proof. Every minimal v — W separator arises as N(X) for some set X with v£X and G[X] connected. The 
bound follows from observing that important separators inducing cliques are nested. That is, we show that 
if X\ and X2 are connected sets containing v such that N(X\) and N(X2) are important v — W separators 
inducing cliques, then either X\ C X 2 or X 2 C X\. 

Suppose that Xi \X 2 and X 2 \Xi are both nonempty. If X\ \ X 2 ^ and X\ is connected, then there is 
a vertex x\ G X\ C\N(X 2 ). As N(X 2 ) is a clique, every vertex of N(X 2 ) is adjacent with #1, implying that 
N(X 2 ) CXiUiV(Xi). IfX 2 \Xi / 0, then a symmetrical argument shows thatN(Xi) CX 2 UiV(X 2 ). Weclaim 
thatAr(XiUX 2 ) QN(Xi)nN(X 2 ) and hence |N(XiUX 2 )| < \N(Xi)\,\N(X 2 )\; asXj UX 2 DX U X 2 , this would 
contradict the assumption that Xi and X 2 are important components. Consider a vertex u G N(X\ UX 2 ), 
which must have a neighbor w G Xi UX 2 . If w G Xi PlX 2 , then u G N(X\) n N(X 2 ) and we are done. Suppose 
without loss of generality that w G Xi \X 2 . Then u G N(Xi) C X 2 U N(X 2 ), but u X 2 by definition, hence u 
has to be in N(X 2 ) as well. 

Suppose now that Xi, X 2 , X t are connected sets containing v such that N(X\), N(X 2 ), N(X t ) 
are important v — W separators inducing cliques. We have shown that the X,'s form a chain, i.e., we can 
assume without loss of generality that Xi C X 2 C • • • C X t . This means that there are at most p of them, as 
the definition of important separator implies that |Af(Xi)| > \N(X 2 )\ > • • • > \N(X t ) \ has to hold. □ 

By Lemma [3.12f 1). we have the following the bound: 

Lemma 3.16. Every vertex v G V(G) \ W is contained in the exact shadow of at most p impsep X such that 
G[N(X)] is a clique. 

Full proof of Theorem \JM The randomized algorithm consists of two phases. For consistency of notation, 
let G\ = G. In the first phase, we select a set of impseps and obtain G 2 for G\ by making the selected 
impseps cliques. In the second phase, we select a set of impseps inducing cliques in G 2 and obtain Z as the 
union of the exact shadows of the selected impseps. 

Phase 1. Let X\ be the set of all impseps. In the first phase, we select a subset X[ C X\ by putting every 
set of X\ into X[ with probability p\ = 4~ p independently at random. Then we make every set X G X[ a 
clique; let G 2 be the graph obtained this way. 

Let 7? be a W-closest set and let S = N(R). By Lemma 13.131 there exists a collection A 2 of impseps, all 
subsets of S, such that V(G) \(RUS) is covered by the exact shadows of these impseps. Let us estimate the 
probability that the events 

(El) Every S' G A 2 induces a clique in G 2 . 

(E2) Every S' G A 2 has the same exact shadow in G\ and in G 2 . 

(E3) Every S' G A 2 is an impsep in G 2 . 

occur. 

Let us make a subset A\ of A 2 such that for every S 2 G A 2 and x,y G S 2 , there is a set S\ G A\ with 
x,y G S\. In other words, the sets in A\ cover every pair {x,y} of vertices covered by the sets in A 2 . Since 
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there are f'?') < (5) such pairs, it is clear that there exists a collection Ay of size at most (£). Observe that, 
by Lemma 13.141 the shadow of every set in A y is disjoint from S. 

Let By contain those impseps whose exact shadows intersect S; by Lemma l3.12l we have |fii | < |5| -4 P < 
p • 4 P . We claim that if every member of A\ is in X[ and no member of By is in X[, then (E1-E3) occur. 

Consider an 5" G Ai. Assuming that every member of A\ is in X[, the set G2IS'] becomes a clique. This 
shows (El). 

To show (E2), that is, that 5" has the same exact shadows in Gy and G2, we show that a subset 5" C S is 
a v — W separator for some vertex v in Gy if and only if it is in G2. This shows that S' is a minimal v — W 
separator in Gy if and only if it is in G2, implying the equalities of the exact shadows. One direction is clear, 
as Gi is a subset of G2. For the other direction, suppose that S' is not a v — W separator in G2. Let K be 
the connected component of v in Gy \S'; by assumption K is disjoint from W. Then there have to be two 
vertices a G K and b $ KUS' that are adjacent in G2 but not in G\. The reason why a and ft are adjacent in 
G2 is that there is some X G X[ with a, ft el. As we assumed that no member of By is in this means that 
the exact shadow of X is disjoint from S (and hence from 5")- As X is an impsep, it is an important (hence 
minimal) q — W separator for some vertex q in its exact shadow. This means that there are paths from q to a 
and ft in the exact shadow of X. Therefore, there is a path P from a to ft in G\ whose internal vertices are in 
the exact shadow of X, hence disjoint from S'. It follows that ft is also in the component K of v in Gy \ S', a 
contradiction. 

Finally, to show (E3), we have to show that every 5' G A2 remains an impsep in G2. As 5' is an impsep, it 
is an important v — W separator for some vertex v. Again, let K be the connected component of v in Gy \ S'. 
By the previous paragraph, 5' is a K — W separator in G2. This implies that S' is an important v — W separator 
in G2 as well: if there is a separator 5" contradicting that 5' is an important v — W separator in G2, then S" 
is a v — W separator in Gy as well (as Gy is a subgraph of G2) and at least one vertex of S' is reachable from 
v in Gy \ S", which means that S" contradicts that S' is an important v — W separator in Gy . 

We can conclude that the probability that (E1-E3) occur can be bounded from below by the probability 
of the event that every set in Ay is selected and no set from By is selected. As the sets Ay and By are disjoint 
(recall that the exact shadow of every member of Ay is disjoint from S by Lemma 13.141 while the exact 
shadow of every member of By intersects S by definition), this probability is at least 

(1 _4~P)/'- 4 '' . (4-py 1 > e - 2 P .4-P 3 = 2~°(p 3 ) 

(in the inequality, we use that 1 +x > exp(x/(l +x)) for every x > — 1 and 1 — 4~ p > 1 /2). 

Phase 2. Let Xi be the collection of all sets 5' that are impseps in G2 and Gi\S'\ is a clique (i.e., we are 
collecting sets that are important v — W separators in G2 for some v). Let X' 2 be a subset of X2 where every 
X G X2 appears with probability p% = 1 — 2~ p independently at random. Let Z be the union of the exact 
shadows of the sets in X' 2 . 

If (E1-E3) occur, then every set in A2 is in X2. If additionally the events 

(E4) ZnS = 0, and 
(E5) A 2 C X' 2 

occur, then every v RUN(R) is in the exact shadow of some 5' G X^ and v G Z follows. 

Let us estimate the probability that both (E4) and (E5) hold on condition that (E1-E3) hold. Let B2 con- 
tain those members of X2 whose exact shadow in G2 intersects S; we have \B2\ < p\S\ < p (by Lemma l3.161 
every vertex of S is contained in the exact shadow of at most p impseps inducing cliques). If no member of 
B2 is selected, then no exact shadow of a set of X' 2 contains a vertex of S, and hence Z n S = 0. Note that 
A2 and B2 are disjoint: by (E2), every S' G A2 has the same exact shadow in Gy and G2, therefore the exact 
shadow of S' G A2 is disjoint from S in G2 as well. Therefore, the probability that (E4) and (E5) hold can be 
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bounded from below by the probability of the event that every member of A 2 is selected and no member of 
B2 is selected, which is at least 

(2-py 2 . (\-2-Pf > 2~ p3 -e- 2 2-°^ 

(again, we use that 1 +x > exp(x/ (1 +x)) for every x > — 1 and 1 — 2~ p > 1 /2). 

Taking into account the probability of success in both phases, we get that for each VK-closest set /?, the 
set Z satisfies the requirements with probability 2~°( p \ □ 

3.6 Derandomization 

By running 2°^ times the algorithm of Lemma |3~T1 we get a collection of instances such that at least 
one of them satisfy the requirements of Lemma 12.31 with arbitrary large constant probability. To obtain 
a deterministic version of Lemma 12.31 we derandomize the algorithm of Theorem 13.61 using the standard 
technique of splitters. 

Lemma 3.17. There is an algorithm DeterministicSets(G,VK,p) that, given a graph G, asetW C V(G), and 
an integer p, produces t = 2 QlyP Hog |V(G)| subsets Z\, . .., Z t ofV(G) \ W such that the following holds. 
For every closest set R with \N(R) \ < p, there is at least one 1 < i < t with 

1. N(R)nZi = <b, and 

2. V(G)\(RUN(R))QZi. 

Proof. An (n,r,r 2 )-splitter is a family of functions from [n] to [r 2 ] such that for any subset X C [n] with 
\X\ = r, one of the functions in the family is injective on X. Naor, Schulman, and Srinivasan [32] gave an 
explicit construction of an (n,r, r 2 )-splitter of size 0(r 6 logrlog?i). 

Observe that in the first phase of the algorithm of Theorem 13.61 a random subset of a universe X\ of 
size n\ = \X\\ <4 p -n is selected, where n = \V(G)\. There is a collection A\ C X\ of a\ < p 2 sets and a 
collection B\ C X\ of b\ < p -4 P sets such that if every set in A\ is selected and no set in B\ is selected, then 
(E1-E3) hold. Instead of selecting a random subset, we try every function / in an (n\,a\ + b\, (a\ +b\) 2 )- 
splitter family and every subset F C [{a\ + b\) 2 ] of size a x (there are ( (ai+ J' l) ) = 2°^) such sets F). For 
a particular choice of / and F, we select those sets X G X\ for which f(X) S F. By the definition of the 
splitter, there will be a function / that is injective on A\ Ufii, and there is a subset F such that f(X) G F for 
every A\ and f(X) F for every B\. For such an / and F, the selection will ensure that (E1-E3) hold. 

In the second phase, we select a random subset of universe X2 of size n-i < pn, and there is a collection 
A2 C X2 of size 02 < 2 P and a collection B2 C X2 of size b2 < p 2 such that if every set in A2 is selected and 
no set in B2 is selected, then (E4) and (E5) hold. As in the first phase, we can replace this random choice 
by enumerating the functions of an («2,«2 + ^2 ; (#2 + ^2) 2 )-splitter and every subset F C [(02 + ^2) ] of slze 
b2 (there are (^ a2 ^ b2 ^ \ = 2°^ such sets F). This time, we select a set X G X2 if f(X) is not in F and it is 
clear that there is an / and F for which (E4) and (E5) hold. 

Let us bound the number of branches of the algorithm. In both phases, the size of the splitter family 
is 2°( p ' Togra and the there are 2°( p ' possible F . (Note that the splitter family can be constructed in time 
polynomial in the size of the family.) Thus the algorithm produces 2°( p ' ■ logn sets. □ 

4 Reduction to the bipedal case 

Let (G,T,W,p) be an instance of the MULTICUT COMPRESSION* problem. Let us call a component of 
G\W having at least two legs a non-trivial component of G w.r.t. W (when the context is clear, we will just 
refer to a non-trivial component). As the solution of MULTICUT COMPRESSION* has to be a set S that is 
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disjoint from W and a multiway cut of W, the number of non-trivial components is a lower bound on the 
size of the solution. 

We present an algorithm that either solves the given instance of the MULTICUT COMPRESSION* prob- 
lem or produces a set of instances of the Bipedal Multicut Compression* problem whose number 
is bounded by a function of p and such that the considered instance of the MULTICUT COMPRESSION* 
problem has a shadowless solution if and only if of the output instances of the BIPEDAL MULTICUT 
COMPRESSION* problem has a solution. In addition, any (not necessarily shadowless) solution of any of 
these output instances is a solution of the input instance of the MULTICUT COMPRESSION* problem. The 
key ingredient of this algorithm is a procedure that, given an instance of the MULTICUT COMPRESSION* 
problem where at least one component has more than 2 legs, reduces this instance to a set of instances whose 
number is bounded by a function of p and such that in each instance either the parameter is decreased or the 
number of non-trivial components is increased. 

The main idea for the branching is the following. Let B be a set of vertices in G \ W and let S be a 
hypothetical shadowless solution for MULTICUT COMPRESSION*. We try to guess what happens to each 
vertex of B in the solution S. It is possible that a vertex v G B is in S; in this case, we delete v from the instance 
and reduce the parameter. Otherwise, as the solution is shadowless, v has to be in the same component as 
precisely one w G W (since S is a multiway cut of W). In this case, identifying v and w does not change the 
solution. 

The following lemma formalizes these observations. Given a set B of vertices in G \ W and a function 
/ : B — > W, we denote by Gf the graph obtained by replacing each set {w} U/ _1 (w) with a single vertex 
(with removal of loops and multiple occurrences of edges). To simplify the presentation, we will assume 
that this new vertex is also named w. We denote by T f the set of terminal pairs where each vertex v G B is 
replaced by f(v), and we denote by T \ v the set where every pair involving the vertex v is removed. 

Lemma 4.1. Let K be a non-trivial component ofG\W with set of legs W and let B CK. Then (G, T, W, p) 
has a shadowless solution if and only if one of the following statements is true. 

• There is av G B such that the instance {G \ v, T \ v, W, p — 1 ) has a shadowless solution. 

• There is afunction f : B — > W such that instance (Gf,Tf,W,p) has a shadowless solution. 

Proof. For the "if" part, we observe that existence of a solution for any of the above instances means 
existence of a solution for (G,T,W,p). This is certainly true in the first case, where we delete a vertex and 
decrease the parameter by 1. In the second case, the statement follows from the fact that replacing G with 
Gf by identifying vertices cannot make the problem any easier. 

For the "only if" part, we observe that if (G,T,W,p) has a shadowless solution then one of the instances 
specified in the statement of the lemma has a solution as well. Indeed, let S be a shadowless solution of 
(G,T,W,p). Then it either intersects or does not intersect with B. In the former case, we can specify a 
v G SHB such that B \ { v} is a solution of (G \ v, T \ v, W, p — 1 ) . In the latter case, we can assign each v G B 
precisely one vertex f(v) of W such that vertex v belongs to the same component G \ S as f(v). It is not hard 
to see that S is a shadowless solution of (G/,Tf,W,p). □ 

Lemma 14.11 determines a set of recursive calls to be applied in order to find a shadowless solution for 
the given instance (G,T,W,p) of the MULTICUT COMPRESSION* problem. It is clear that in each step, the 
number of directions we branch into is bounded by a function of p, \B\, and \ W\ (observe that the number of 
functions / : B — > W can be bounded by |Wf B l). However, in order to ensure that the size of the search tree is 
bounded, we need to ensure that the height of the search tree is bounded as well. This is obvious for the first 
type of branches, as p decreases. The following property ensures that in every branch of the second type, 
either the number of nontrivial components increases or we get an instance that trivially has no solution. 



17 



Definition 4.2. Let K be a non-trivial component and letW^W be its set of legs. Let B be a subset of 
K. We say that B is a shattering set if for any function f : B —}W one of the following statements is true 
regarding the instance (Gf,Tf,W,p) of the Multicut COMPRESSION*. 

• There is aw such that there is now — (W \ {w}) separator of size at most p in in Gf[KU W]. 

• The number of non-trivial components ofGf\ W is greater than the number of non-trivial components 
ofG\W. 

Note that the first possibility includes the case when G/[W] is not an independent set (recall that an X — Y 
separator is disjoint from XUY by definition). In Section |4~T1 we present a polynomial-time algorithm for 
finding a shattering set. 

Lemma 4.3. Given an instance (G,T,W,p) of the MULTICUT COMPRESSION* problem and a component 
K of G\W with more than two legs, we can find a shattering set B C K of size at most 3p in polynomial 
time. 

With Lemma FOI in mind, we are ready to prove Lemma [2~4l the main statement of this section. 

Proof (of Lemma \Z4\) . The desired algorithm looks as follows. If the given instance (G,T,W,p) satisfies 
one of the following cases, then we can determine the answer without any further branching: 

• All the terminal pairs of T are separated: this is a "YES" instance. 

• The parameter is zero while there are unseparated terminals: this is a "NO" instance. 

• There is a w £ W such that there is no w — (W \ {w}) separator of size at most p in G: this is a "NO" 
instance. The situation where W is not an independent set is a special subcase of this case. 

• The number of non-trivial components is greater than p: this is a "NO" instance since each non-trivial 
component contributes at least one vertex to any solution. 

• Every component has at most two legs: this is an instance of Bipedal Multicut Compression* 
problem and hence it is returned as the output. 

Otherwise, we choose a component K of G \ W having more than two legs, and use Lemma R31 to compute 
a shattering subset B of K of size at most 3 p. We apply recursively the branches specified in the statement 
of Lemma |4~T1 If the "YES" answer is obtained on at least one of these branches, then we return "YES". 
Otherwise, we return "NO". According to Lemma |4~TI the resulting answer is correct. Furthermore, assume 
that no one of branches produces a "YES" or "NO" answer. Then, according to Lemma 14.11 the parent 
instance has a shadowless solution if and only if the instance on one of the branches has a shadowless 
solution. It is also not hard to notice that any solution for a branch instance can be easily transformed into a 
solution of the parent instance. Applying this argument inductively, we conclude that the same relationship 
exists between the original instance (G,T,W,p) and the Bipedal Multicut Compression* problem 
instances at the leaves of the recursion tree. 

To bound the number of leaves of the recursion tree, let us define K to be the number of nontrivial 
components. Observe that removing a vertex V(G) \ W from G can decrease the number of nontrivial 
components only by at most one. Thus inspection of Lemma |4~T1 shows that the measure 2p — K strictly 
decreases in each branch. This means that the height of the search tree is at most 2p. The number of 
branches in each step can be bounded by 3p + |W| 3/ '. Thus the number of leaves of the recursion tree can be 
generously bounded by 2°^ p+log ^ w ^ . Taking into account that the runtime per node of the recursion tree is 
polynomial, it follows that the runtime of this algorithm is O* (2°^ p+log ^ w ^ >). □ 
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4.1 Finding a shattering set 

We try to find a shattering set by selecting a set that separates one leg from all the other. If it is not a 
shattering set, then we can characterize quite well how it can look like, and where should we continue our 
search for a shattering set. Let us start with two simple lemmas. 

Lemma 4.4. Let K be a non-trivial component with a set W of at least 3 legs. If G[M{\ and G[M2] are 
both connected for two disjoint sets M[,M2 Q K, then at most one ofM\ and M2 can be a multiway cut of 
(G[KUW],W). 

Proof. Assume the opposite. Since no two vertices of W belong to the same component of G[KU W] \M\ 
and \W\ > 3, we can specify two vertices w' and w" of W whose respective components C and C" in 
G[K U W] \ M\ are disjoint from the connected set M2. Then there is a w' — w" path in G[K U W] that uses 
vertices from C", then vertices from (the connected set) M\, then vertices from C". This path is disjoint from 
M2, contradicting the assumption that M2 is a multiway cut. □ 

Lemma 4.5. Let K be a non-trivial component with a set W of at least 3 legs. Let B CK be a non-shattering 
set. Then there is exactly one connected component of G[K\B] which is a multiway cut of(G[KL)W],W). 

Proof. Let / : B — > W be the mapping witnessing that B is not a shattering set. Let K' C K \ B be the 
unique non-trivial component of Gf\W which is a subset of K. It is easy to see that K' a component of 
G[K\B] as well. Furthermore, K' is a multiway cut of (G[KUW],W). Otherwise, a path between vertices 
of W in G[K(JW]\K' would correspond to a walk of Gf between the same vertices which belong to a 
non-trivial component that is a subset of K but different from K', in contradiction to the definition of /. 
Finally, Lemma l4~4l implies that K' is the unique connected component of G[K\B] being a multiway cut of 
G[KUW]. □ 

Let K be a non-trivial component with a set of legs W. Let M C K be a multiway cut of (G[K U W] ,W). 
We call N(M) (i.e the open neighborhood of M) the boundary of M (which possible includes vertices of W). 
For each w € W, the image I(w) of w is the set of vertices of N{M) reachable from w in G[K UW]\M (the 
image may include vertex w itself), see . Note that I(w) is nonempty for any w£ff: consider the first vertex 
of N(M) on a path from w to some other leg in W. Furthermore, as M is a multiway cut, the sets I(w') and 
I{w") are disjoint for w' 7^ w": otherwise, there would be a w' — w" path disjoint from M. For XCW,we let 
= Uwexl( w )- Let us select a distinguished leg w* G W. We say that M is good if all of the following 
conditions are true. 

• G[M] is connected. 

• N{M) = I(W) or, in other words, each vertex of N(M) is reachable in G[^U W] \M from some vertex 
of W. 

• |/(w*)| < P and \I(W \ {w*})\ < p holds (and hence we have \N(M)\ < 2p). 

Our goal is to obtain a shattering set from the boundary of a good multiway cut. The following lemma 
shows that either we can do this, or find a smaller good multiway cut. 

Lemma 4.6. Let K be a non-trivial component with a set W of at least 3 legs and a distinguished leg w*. Let 
M be a good multiway cut of (G[KUW],W). Then there is a polynomial-time algorithm that either returns 
a shattering set of size at most 3p or a good multiway cut M 1 C M. 

Proof. The desired algorithm first computes a smallest I(w*) —I(W \ {w*}) separator S of G[N{M) DM] 
(recall that the images are nonempty). Observe that 5 is an inclusionwise minimal w* — W \ {w*} separator 
in G[KU W] (and hence nonempty). We consider three cases: 
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Figure 2: M is a multiway cut of the 4 legs {1,2,3,4}. The dark region represents the boundary of M. 
Observe that 7({1,2,3,4}) is a proper subset of the boundary: vertices of the boundary that are adjacent 
only to C and C" are not in I(w) for any w € {1,2,3,4}. 

1. If \S\ > p, then the algorithm returns B := N(M) \ W reporting it as a shattering set. 

2. If \S\ < p and there is a unique connected component M' of G[K\ (N(M)\JS)] which is a multiway 
cut of (G[KU W], W), then the algorithm returns M 1 reporting it as a good multiway cut. 

3. If \S\ < p and there is no such unique M', then the algorithm returns B := (N(M)(JS)\W reporting it 
as a shattering set. 

This algorithm clearly takes polynomial time. The remaining proof establishes correctness of the algorithm 
in each of these three cases. 

Case 1. We prove that B := N(M) \ W is a shattering set. Otherwise, let / : B — > W be a function 
witnessing that B it is not a shattering set. It is not hard to see that M is a connected component in Gf \ W 
whose set of legs is a subset of W. We consider three subcases and arrive to a contradiction in each of them. 

Case la. M is a trivial component of Gf\W. Let w be the only leg of M. Let w\ and W2 be other 
two distinct legs of K in G that are different from w. It follows that / maps every vertex of I{w\) U/(w2) 
to w implying that there is a w — w\ and w — W2 path in Gf whose internal vertices belong to two different 
components adjacent to w\ and W2 in G[^Uff] \M. Thus Gf has at least two non-trivial components that 
are subsets of K, in contradiction to the choice of /. 

Case lb. M is a nontrivial component of Gf \ W and f(v) = w for every v 6 I(w) and w 6 W (i.e., 
each vertex on the boundary is mapped to its preimage). As the smallest I(w*) — I(W \ {w*}) separator 
in G[N(M) UM] is larger than p, G[Ml)W] does not have a w* — W \ {w*} separator of size at most p in 
contradiction to / being a witnessing function. 

Case lc. M is a nontrivial component of Gf \ W and there are distinct w\ , W2 € W such that f(v) = W2 for 
some v G I{w\). By definition of I{w\), there is a w\ — v path in G whose internal vertices are fully contained 
in K\M. Therefore, there is a w\ — W2 path in Gf whose internal vertices are disjoint from M, implying that 
Gf has a nontrivial component which is a subset of K, but distinct from the nontrivial component M. Thus 
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Figure 3: The two possibilities in Case 2 of Lemma l4~6l (the set of legs is W = {w*, 1,2,3}): either (a) 
N{M') <ZI M ( W *)US or (b) N(M') C I M (W \ {w*}) US holds. 



the number of nontrivial components increases, a contradiction. 

Case 2. We show that M'cM and M' is a good multiway cut in this case. Let us show M'cM first. 
Clearly, M' ^ M, as M' is disjoint from the (nonempty) set S C M. Thus M' ^ M is only possible if M' 
is disjoint from M, but Lemma |4~4l implies that the two disjoint connected sets M and M' cannot be both 
multiway cuts. 

For clarity, from now on we use Im{w) and 7m'( w ) f° r tne image of w on the boundary of M and M', 
respectively. Observe that Im{w) C\N{M') C Im*{w) for every w£ff: for every v G nN(M'), there is a 
w — v path disjoint from M, which is obviously disjoint from M'CMas well. This immediately implies that 
either Im(w*) or Im(W \ {w*}) is disjoint from N{M')\ otherwise, a vertex vi € Im{w*) C\N{M') and a vertex 
^2 £ /af (W \ {w*}) f~)N(M') can be connected by a path P whose internal vertices are in M' (hence disjoint 
from S), and path P can be extended to a w* — W \ {w*} path disjoint from S, contradicting the definition 
of S. Therefore, either N(M') C l M {w*) U S or N(M') C 7 M (# \ {w*}) U 5 holds. The two possibilities are 
demonstrated in Figure [3] 

To show that |/m'( w *)I an d \Im'{W \ {w*})\ are both at most p, we argue as follows. Suppose first that 
N(M') C I M (w*)US. We show that 7 M ,(W \ {w*}) = SDN(M') and therefore I M >(w*) = I M (w*) DN(M'), 
proving the bound on both |4f'(w*)| and \Im'{W \ {w*})\. Note that this implies furthermore that N{M') = 
I M /(W), i.e., every vertex of N(M') is the image of some leg. From Im{w*) nN(M') C Im*{w*) we already 
know that I M ,{W \ {w*}) C Sf\N{M'). To show that SC\N{M') C Z M ,(W \ {w*}) holds, consider a vertex 
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s eSD N(M'). By the minimality of S, for some w G W \ {w*} there is a w* - w path P in G[K U W] that 
intersects 5 exactly in s. As M' is a multiway cut, path P has to go through M'. Furthermore, we can assume 
that P contains exactly 2 vertices of N(M'). One of these two vertices is s, and the other has to be a vertex 
v 6 Im(w*) (as P contains only one vertex of S). It follows that P has a subpath Pi connecting W and v, and 
a subpath P2 connecting W and s, both of them disjoint from M'. As v G 7m(w*), path Pi connects w* and v, 
thus path P2 connects 5 and w. Therefore, s is in Im 1 (W \ {w*}), what we had to show. 

A symmetrical argument (exchanging the role of w* and W \ {w*}) shows that if N(M') C Im(W \ 
{w*})US, then/ M ,(w*) =SDN(M'), implying the bounds on \I M >(w*)\ and \I M >{W\{w*})\. Thus in both 
cases, we proved that M' C M is a good multiway cut. 

Case 3. Assume now that the algorithm returns B := (SUN(M)) \ W as a shattering set. This happens 
because the number of components of G[K\ (N(M) US)] which are multiway cuts of (G[KU W],W) is not 
exactly one. According to Lemma 1431 N{M) U S is indeed a shattering set in this case. Clearly, its size is at 
most 3 p. □ 

Lemma 1431 follows by iterative application of Lemma 1431 

Proof ( of Lemma It is not hard to see that K is a good multiway cut of (G[KUW],W). Let Mq = K. 
Apply the algorithm of Lemma 1431 to Mq. The algorithm either returns a shattering set of size at most 3p or 
a good multiway cut M\ C Mq. In the former case, we return the shattering set, in the latter case, apply the 
algorithm of Lemma l431 to Mi . Continuing this way, we obtain a sequence Mq D Mi D ... of good multiway 
cuts of decreasing size. It follows that after at most iterative applications of the algorithm of Lemma 

14.61 a shattering set of size at most 3p will be returned. □ 



5 Finding a shadowless solution by reduction to Almost 2SAT 

The goal of this section is to show that we can solve Bipedal Multicut Compression* if we know that 
there is at least one shadowless solution. 

Let x\, . . . , x n be a set of variables; a literal is either a variable x,- or its negation x,. Recall that a 2CNF 
formula is a conjunction of clauses with at most two literals in each clause, e.g., (x\ VX2) A (I3) A (xi VX4). 
The classical 2SAT problem asks if a given 2CNF formula has a satisfying assignment. It is well-known that 
a satisfying assignment for a 2CNF formula can be found in linear time (if exists). However, it is NP-hard 
to find an assignment that maximizes the number of satisfied clauses, or equivalently, to find a minimum set 
of clauses whose removal makes the formula satisfiable. Cygan et al. 11121 (improving earlier work Il36ll35l0 
gave an O* (4 k ) time algorithm for the problem of deciding if a 2CNF formula can be made satisfiable by the 
deletion of at most k clauses; they call this problem ALMOST 2SAT. We need a variant of the result here, 
where instead of deleting at most k clauses, we are allowed to delete at most k variables. An easy reduction 
(see Appendix 151 gives an algorithm for this variant. If is a 2CNF formula and X is a set of variables, then 
we denote by \ X the formula obtained by removing every clause containing a literal of a variable in X. 

Theorem 5.1. Given a 2CNF formula and an integer k, in time 0*(4 k ) we can either find a set X of at 
most k variables such that (j)\X is satisfiable, or correctly state that no such set X exists. 

It is not difficult to reduce finding a shadowless solution to the problem solved by Theorem l5.ll For each 
vertex v of G\ W, we introduce a variable whose value expresses which leg of the component containing v 
is reachable from v. This formulation cannot express that a vertex is separated from both legs. However, as 
we assume that there is a shadowless solution, we do not have to worry about such vertices. 

Proof ( of Lemma |231). We encode the Bipedal Multicut Compression* instance / = (G,T,W,p) as a 
2CNF formula (j) the following way. For each component C of G \ W having two legs, let £q(C) and i\ (C) 
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be the two legs. If component C has only one leg, then let £q(C) be this leg, and let £\{C) be undefined. 
For every vertex v G C, let £q(v) = £q(C) and £\{v) = £\{C). We construct a formula whose variables 
correspond to V(G) \ W. The intended meaning of the variables is that v has value b G {0, 1} if v is in 
the same component as £b(v) after removing the solution. To enforce this interpretation, contains the 
following clauses: 

• Group 1: (u — > v), (v — > u) for every adjacent u,v G V(G)\W. 

• Group 2: If u is a neighbor of £b(u) for some Z? G {0, 1}, then there is a clause (u = b). 

• Group 3: If (u,v) G T, m,v G" W, and £b u (u) = ( v ) f° r some b u ,b v G {0,1}, then there is a clause 
{u ^ b u \J v ^ b v ) (e.g., if ^o(«) = £\(v), then the clause is («Vv)). 

• Group 4: If (u,v) G T, m G W, v G" W, and ^(v) = m for some b G {0, 1}, then there is a clause (v ^ 

This completes the description of 0. Note that no clause is introduced for pairs (u,v) G T with u,v G W, 
but these pairs are automatically separated by a solution that is a multiway cut of W. Furthermore, we can 
assume that W induces a independent set, otherwise there is no solution. 

We show first that if / has a shadowless solution S, then removing the corresponding variables of 
makes it satisfiable. As S is shadowless and it is a multiway cut of W, every vertex of G \ S is in the same 
component as exactly one of £q{v) and ^i(v); let the value of variable v be b if vertex v is in the same 
component as £b{v). It is clear that this assignment satisfies the clauses in the first two groups. Consider 
a clause (u / i„Vv / b v ) from the third group. This means that (u,v) G T and £b u (u) = £b v {v) = w G W. 
If this clause is not satisfied, then u = b u and v = b v . By the way the assignment was defined, this is only 
possible if u is in the same component of G \ S as £b u (u) = w and v is in the same component of G \ S as 
£b v (v) = w. Therefore, u and v are in the same component of G \ S, contradicting the assumption that S is a 
solution of /. Clauses in Group 4 can be checked similarly. 

We have shown that can be made satisfiable by the deletion of p variables. By Theorem 15.11 we can 
find such a set S' of variables time 0*(4 P ). To complete the proof, we show that such a set S' corresponds to 
a (not necessarily shadowless) solution of /. Let us show first that S' is a multiway cut of W. Suppose that 
there is a path P connecting wq,w\ G W in G \ S'. We can assume that the internal vertices of P are disjoint 
from W, i.e., they are in one component C of G \ W with two legs. Thus there is a path P' from a neighbor 
vo of wo to a neighbor vi of w\ in C\S'. Suppose without loss of generality that £q(C) = wq and £\{C) = w\. 
As the clauses in Group 1 are satisfied, every variable of P' has the same value. However, because of the 
clauses in Group 2, we have x Vo = and x Vl = 1 , a contradiction. Therefore, we can assume that 5' is a 
multiway cut of W. 

Suppose now that there is some (u,v) G T such that i/,v^f are in the same component of G\S'; let P 
be a u — v path in G \ S'. As W is a multicut of T, it is clear that P goes through at least one vertex of W. 
We have seen that S' is a multiway cut of W, thus P goes through exactly one vertex of W. Let P = P\wP2 
for some path P\ that is fully contained in the component of G \ W containing u and path P2 fully contained 
in the component containing v. Let b u ,b v G {0, 1} be such that £b u {u) = £b v (v) = w. Group 1 ensures that 
every variable of Pi has the same value and Group 2 ensures that the last variable of Pi has value b u , thus 
u = b u . A similar argument shows that v = b v . However, this means that clause [u^b u \l v ^ v u ) of Group 
3 is not satisfied, a contradiction. Finally, a similar argument shows that the clauses in Group 4 ensure that 
pairs (u,v) G T with u G W, v W are separated. □ 

6 Hardness of Directed Multicut 

We prove that DIRECTED EDGE MULTICUT is W[l]-hard parameterized by the solution size, thus it is not 
fixed-parameter tractable (assuming the widely-held complexity hypothesis FPT 7^ W[l]). Recall that the 
edge and vertex versions are equivalent, thus the hardness result holds for both versions. The proof below 
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Figure 4: Part of a connection between gadgets G,j and Gjj in the proof of Theorem |6.1| The dashed edges 
represent terminal pairs. 



proves the hardness result for the weighted version of the problem, where each edge has a positive integer 
weight, and the task is to find a multicut with total weight at most p. If the weights are polynomial in the 
size of the input (which is true in the proof), then the weighted version can be reduced to the unweighted 
version by introducing parallel edges. Thus the proof proves the hardness of the unweighted version as well. 
For notational convenience, we allow edges with weight oo; such edges can be easily replaced by edges with 
sufficiently large finite weight. 

Theorem 6.1. Directed Edge Multicut is W[l]-hard parameterized by the size p of the cutset. 

Proof. We prove hardness for the weighted version of the problem by parameterized reduction from Clique. 
Let G be a graph with m edges and n vertices where a clique of size t has to be found. We construct an in- 
stance of Directed Edge Multicut containing t(t — 1) gadgets: for each ordered pair (1 < i,j < t, 
i 7^ j) there is a gadget Gij. Intuitively, each gadget G ; j has 2m possible states and a state represents 
an ordered pair (v,-,vy) of adjacent vertices. We would like to ensure that the gadgets describe a £-clique 
{vi, .. . ,v t } in the sense that G,-j represents the pair (v,-,V/). In order to enforce this interpretation, we need 
to connect the gadgets in a way that enforces two properties: 

(1) if G,-j represents (v;,v 7 ), then Gjj represents (v ; -,v;) and 

(2) if Gij represents (v,-,v/) and Gij represents (ui,uj), then v* = »,-. 

(Note that it follows from these two conditions that if Gij and G?j represent (v,, Vj) and (ui,uj), respectively, 
then vj = uj.) 

Let us identify the vertices of G with the integers 0, 1 and let us define l(x,y) = xn + y, which 

is a bijective mapping from {0, ... ,n — 1} x {0, . . . ,n — 1} to {0, ... ,n 2 — 1}. The gadget Gu has n 2 + 1 

vertices w^j, . . . , wfj. Let D := 2t 2 . For every < s < n 2 , there is an edge w- jW^ 1 whose weight is D 
if is a pair (x,y) such that x and y are adjacent in G, and °° otherwise. Furthermore, there is an 

additional edge wfjW®j with weight oo. The DIRECTED EDGE MULTICUT instance contains the terminal 

pair (w^j, wfj), which means that at least one of the edges wf^wfj" having finite weight has to be in the 

multicut. If the multicut contains exactly one such edge wfj, wfj in the gadget, then we say that the gadget 
represents the pair l~ l (s). We set p := t(t — l)D + t + t(t — l)/2 to be the maximum weight of the multicut. 
Since p < t (t — 1 )D + D, a multicut of weight at most p contains exactly one edge of weight D from each 
gadget, implying that each gadget represents some pair. 
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For every 1 < i < j < t, we connect Gjj and Gjj in a way that ensures that if G,-j represents the 
pair (x,y), then G ;J - represents the pair (y,x) (see Fig. 5]). More precisely, we show that if the multicut 
contains exactly one edge of the connection and G,j (resp., Gjj) represents the pair (x,y) (resp., (x' ,y')), 
then x = y and y = x' . For every ordered pair (x,y) of adjacent vertices of G, let us introduce two new 

vertices aff \ b\ x j y \ and the directed edge af^,bf^ having weight 1. Furthermore, let us add the edges 
\v} x ' y ^af^ and w^^aff^ having weight oo. Finally, let us add the terminal pairs (w)^ y ^ +l ,b^f^) and 
(w 1 ^^ 1 ,b\ x j y ^)- Observe that if Gjj represents (x,y), then w-y (and hence aff^) is reachable from 
w ( -^' y ' +1 , which means that the multicut has to contain the edge af^ , b^jj . Similarly, if Gjj represents 

l ( x 1 y f ) fv' x f ) l(x ! V) I 1 

(x,y), then wy- (and hence a\- ) is reachable from w- ; , which means that the multicut has to 

i y i x ') f/V) 

contain the edge a-y ,6," ■' . If the multicut contains only one edge of the connection between Gjj and 
Gjj, the two edges must coincide, and we have x = y',y = x'. 

For every 1 < i < t and < x < n, we introduce two new vertices cf and df and connect them with the 

edge c\d\ having weight 1. For every 1 < j < t, i ^ j, < x < n, we add an edge w-^' ^ having weight 
oo and a terminal pair (w l ^'J +1 ' \df ). This completes the description of the reduction. Note that if G,-,y 
represents (x,y), then l(x,0) < l(x,y) < l(x+ 1,0) implies that w^J' ^ is reachable from ;* + , which 

means that the edge cfdf has to be in the cut to prevent df from being reachable from x\^ +l,Q> \ 

Suppose there is a multicut of weight at most p. This means that the multicut contains at most t(t — 1) 
edges of weight D, thus each gadget G/j contains exactly one edge of weight D, i.e., each gadget represents 
some pair (x,y). As discussed in the previous two paragraphs, if Gjj represents (x, y), then cjdf is in the 

multicut. Furthermore, depending on whether i < j or i > j holds, either af'^b'f^ or a^b^f , respectively, 
is in the multicut as well. If the weight of the multicut is at most p, then the total weight of these edges is 
at most t +t(t — 1), which is only possible if these edge coincide in every possible way and it follows that 
properties (1) and (2) hold. Therefore, there are distinct vertices vi, . . . , v t such that gadget G, j represents 
(v{,Vj), which implies that vi, .. ., v t is a clique in G. 

For the other direction, suppose that vi, . .., v t is a clique in G. Let us consider the multicut that contains 
the following edges: 

l(v;,V;) l(v;,V;) + l c , ^ . . ^ . . , . 

• Wjj Wfj 1 for every 1 <i,j <t,if^ j, 

• a^y^bff^ for every 1 < i < j < t, and 

• cfdf 1 for every 1 < i < t. 

The total weight of these edges is exactly p. The edges in the first group ensure that w"j is not reachable 
from w { -j for any i, j. For some i < j and adjacent vertices x and y, consider a terminal pair (w,- ^ , b\ j ^). 

If (x,y) 7^ (vj,vj), then edge w)^j" v w]j 1 of the multicut ensures that (and hence b 1 /*^) is not 

reachable from w)^ y ^ +l . If (x,y) = (v,,v ; ), then edge afj' Vj ^bfj' Vj ^ is in the multicut, again disconnecting 
this terminal pair. For i > j, an analogous argument shows that terminal pair {w\ j ,b l j y i ' x ') for every x,y 

is disconnected. Consider now the terminal pair (w]^f +1 '°\ df) for some 1 < i,j < t, i ^ j, < x < n. If 

> 

x / Vi, then i(v,,v ; ) is either less than i(x,0) or at least l(x + 1,0), thus the edge w ! -y'" v ^w ! -y' ! ' v of the 
multicut ensures that w] ^ (and hence df) is not reachable from w) ^ +1,0 \ On the other hand, if x = v,-, then 
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the edge c^d}' is in the third group of the multicut. Thus we have shown that if there is a clique of size t in 
G, then there is a multicut of size at most p. 

□ 
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A Important separators 

First we state without proof some properties of Definition [3/7] that are easy to see: 

Proposition A.l. Let G be a graph, X,Y C V(G) be two disjoint sets of vertices, and S be an important 
X — Y separator. 

1. For every v £ S, the set S \ {v} is an important X — Y separator in G\v. 

2. If S is anX' — Y separator for some X' D X, then S is an important X' — Y separator. 

Proof ( of Lemma 1X51 ). We prove by induction on 2p — A that there are at most 2 2p ~ x important X — Y 
separators of size at most p, where A is the size of the smallest X — Y separator. If A > p, then there is no 
X — Y separator of size p, and therefore the statement holds if 2p — A < 0. Also, if A = and p > 0, then 
there is a unique important X — Y separator of size at most p: the empty set. 

If S is an X — Y, then we denote by K$ the union of every component of G \ S intersecting X. First we 
show the well-known fact that there is a unique X — Y separator S* of size A such that K$* is inclusionwise 
maximal, i.e., we have K$ C K$* for every other X — Y separator S of size A. Suppose that there are two 
separators 5' and 5"' such that Kg and K S " are incomparable and inclusionwise maximal. Let us define the 
function y(Z) = \N(Z)\. It is well-known that 7 is submodular, that is, 

7(A) + 7(B) > y(A U B) + y(A n B) 

for every A,BC V(G). In particular, the submodularity of gamma implies that 

y{K s ,) + y(K s ») > y{K s , U K s „) + y(K s , n K s „) . 

The left hand side is exactly 2A, while the second term of the right hand side is at least A (as N(K s >r\Ks") 
is anZ — Y separator). Therefore, y{Ks' UXs") < A. This means that N(Ks>LIK s ii) is also a minimum X — Y 
separator, contradicting the maximality of S' and 5"'. 

Next we show that for every important X — Y separator S, we have K s * C K s . Suppose this is not true for 
some 5. We use submodularity again: 

y{K s * ) +r(K s ) > y(K s , U K S ) + Y(K S , n K s ) . 

By definition, 7(^5*) = A, and N(Ks* nK$) is an X — Y separator, hence 7(^5* r\K$) > A. This means 
that y{Ks* UKs) < y(Ks)- However this contradicts the assumption that S is an important X — Y separator: 
N(K$* UK$) is an X — Y separator not larger than 5, but Ks* U Ks is a proper superset of Ks (as Ks* is not a 
subset of #s by assumption). 
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We have shown that for every important separator S, the set K$ contains K$* . Let v G 5* be an arbitrary 
vertex of S* (note that A > 0, hence 5* is not empty). An important X — Y separator S of size at most p 
either contains v or not. If 5 contains v, then S \ {v} is an important X — Y separator in G \ v of size at most 
p' := p— 1 (Prop. lATTl fTI)). As v ^X,Y, the size A' of the minimum X — Y separator in G\v is at least A — 1. 
Therefore, 2p' — X' <2p — X and the induction hypothesis implies that there are at most 2 2p ~^ < 2 lp ~ x ~ x 
important X — Y separators of size p' in G\ {v}, and hence at most that many important X — Y separators of 
size pinG that contain v. 

Let us count now the important X — Y separators not containing v. Note that by the minimality of S* , 
vertex v of S* has a neighbor in K$* . We have seen that K$* C K$ for every such X — Y separator S. As 
v g" S and v has a neighbor in Ks, even Ks* U {v} C Ks is true. Let X' = Ks* U {v} ; it follows that S is a 
X' — Y separator and in fact an important X' — Y separator by Prop. IA.ltl2l) . There is no X' — Y separator S 
of size A: such a set S would be an X — Y separator of size A as well, with Ks* U {v} C K$, contradicting 
the maximality of 5*. Thus the minimum size A' of an X' — Y separator is greater than A. It follows 
by the induction assumption that the number of important X' — Y separators of size at most p is at most 
2 2 P -A' < 2 2 p- x -\ which is a bound on the number of important X — Y separators of size p in G that does 
not contain v. 

Adding the bounds in the two cases, we get the required bound 2 2p ~^. An algorithm for enumerating 
the important separators follows from the proof: we compute the separator S* in polynomial time and then 
branch on whether v is in the separator or not. □ 



B Deleting variables in Almost 2SAT 

Proof ( of Theorem 15. 1 D . Let x\, . .., x n be the variables of (p. We create a new 2CNF formula <p' on 2n 
variables (1 < i < n, b G {0, 1}). The intended meaning of jcf is that its value is 1 if and only if the value 
of Xi in is b. For every 1 < i < n, let us introduce a clause (x { Vx) ) in formula <p'. For every clause of (j), 
there is a corresponding clause of 0' where literal Xj is replaced by literal x) and literal x,- is replaced by 
(e.g., (xiVxj) is replaced by (x? Vxy). 

We claim that there is a set X of variables in such that \ X is satisfiable if and only if there is a set 
X' (\X\ = \X'\) of clauses in <p' such that <p'\X' is satisfiable. As the existence of such a X' can be tested by 
the algorithm of |[36l in time 0*(4 k ), the theorem follows from this claim. 

Suppose first that there is such a set X of variables in 0; let / be a satisfying assignment of \X. Let 
X' contain the clauses (xf Vx,- ) for every x,- G X. Let us define /'(xf) = f'(xj) = 1 if x,- G X, and for every 
Xj $lX, let /'(xf) = 1 if and only if /(x,-) = b. It is straightforward to verify that f satisfies (j)'\X'. 

For the other direction, let us suppose that X' is a set of clauses such that <p'\X' is satisfiable and let /' 
be a satisfying assignment of (j)' \X'. The important observation is that we can assume that X' contains only 
clauses of the form (xf Vx - ). To see this, observe that variables x\ appear negatively only in the clauses of 
this form. Thus if X' contains a clause C that where xf appears positively, then we can replace C in X' by 
(xf Vxj ) and set /(xf ) = f(xj) = 1. Let X contain a variable x, if (xf Vx|) is in X'. It is easy to verify that 
defining /(x,) = f(xj ) gives a satisfying assignment of \ X. □ 



29 



